Nowcasting economic and social data: when and why search engine data fails, an illustration using Google Flu Trends

نویسندگان

  • Paul Ormerod
  • Rickard Nyman
  • R. Alexander Bentley
چکیده

Obtaining an accurate picture of the current state of the economy is particularly important to central banks and finance ministries, and of epidemics to health ministries. There is increasing interest in the use of search engine data to provide such ’nowcasts’ of social and economic indicators. However, people may search for a phrase because they independently want the information, or they may search simply because many others are searching for it. We consider the effect of the motivation for searching on the accuracy of forecasts made using search engine data of contemporaneous social and economic indicators. We illustrate the implications for forecasting accuracy using four episodes in which Google Flu Trends data gave accurate predictions of actual flu cases, and four in which the search data over-predicted considerably. Using a standard statistical methodology, the Bass diffusion model, we show that the independent search for information motive was much stronger in the cases of accurate prediction than in the inaccurate ones. Social influence, the fact that people may search for a phrase simply because many others are, was much stronger in the inaccurate compared to the accurate cases. Search engine data may therefore be an unreliable predictor of contemporaneous indicators when social influence on the decision to search is strong.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Nowcasting with Google Trends: a keyword selection method

Search engines, such as Google, keep a log of searches entered into their websites. Google makes this data publicly available with Google Trends in the form of aggregate weekly search term volume. Aggregate search volume has been shown to be able to nowcast (i.e. compute real-time assessment of current activity) a variety of variables such as influenza outbreaks, financial market fluctuations, ...

متن کامل

Adaptive nowcasting of influenza outbreaks using Google searches

Seasonal influenza outbreaks and pandemics of new strains of the influenza virus affect humans around the globe. However, traditional systems for measuring the spread of flu infections deliver results with one or two weeks delay. Recent research suggests that data on queries made to the search engine Google can be used to address this problem, providing real-time estimates of levels of influenz...

متن کامل

Correlation between National Influenza Surveillance Data and Google Trends in South Korea

BACKGROUND In South Korea, there is currently no syndromic surveillance system using internet search data, including Google Flu Trends. The purpose of this study was to investigate the correlation between national influenza surveillance data and Google Trends in South Korea. METHODS Our study was based on a publicly available search engine database, Google Trends, using 12 influenza-related q...

متن کامل

Forecasting the Incidence of Dementia and Dementia-Related Outpatient Visits With Google Trends: Evidence From Taiwan

BACKGROUND Google Trends has demonstrated the capability to both monitor and predict epidemic outbreaks. The connection between Internet searches for dementia information and dementia incidence and dementia-related outpatient visits remains unknown. OBJECTIVE This study aimed to determine whether Google Trends could provide insight into trends in dementia incidence and related outpatient visi...

متن کامل

Macroeconomic Nowcasting Using Google Probabilities∗

Many recent papers have investigated whether data from internet search engines such as Google can help improve nowcasts or short-term forecasts of macroeconomic variables. These papers construct variables based on Google searches and use them as explanatory variables in regression models. We add to this literature by nowcasting using dynamic model selection (DMS) methods which allow for model s...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1408.0699  شماره 

صفحات  -

تاریخ انتشار 2014